Universal Properties of Growing Networks 

P. L. KrapivskJ*] 

Center for Polymer Studies and Department of Physics, 
Boston University, Boston, MA 02215, USA 

B. Derrida 

Laboratoire de Physique Statistique, Ecole Normale Superieure, 
24 rue Lhomond, 75231 Paris Cedex 05, France 

Abstract 

Networks growing according to the rule that every new node has a probability p^. of being 
attached to k preexisting nodes, have a universal phase diagram and exhibit power law decays 
of the distribution of cluster sizes in the non-percolating phase. The percolation transition is 
continuous but of infinite order and the size of the giant component is infinitely differentiable at 
the transition (though of course non-analytic). At the transition the average cluster size (of the 
finite components) is discontinuous. 
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This work is dedicated to the memory of Per Bak whose scientific work had an oustanding 
influence on the development of Statistical Physics and of the Theory of Complex Systems. 



I. INTRODUCTION 

A number of recent works have shown that simple models of growing 

networks exibit an unexpected degree of universality with a percolation transition of infinite 
order. The goal of the present paper is to summarize and extend these recent results, and to 
show how the general case of a growing network can be understood via a simple theoretical 
approach based on the analysis of the differential equation (fTB|) satisfied by the generating 
function of distribution of cluster sizes. 

Perhaps the most studied 0, 0] random network model is the random graph introduced 
by Erdos and Renyi where each pair of vertices of a graph of vertices are directly 
connected with a probability c/N . The Erdos and Renyi random graph exhibits a percolation 
transition at c = 1 and the number NG of vertices in the giant component vanishes linearly 
at the transition (G ~ (c — 1) as c ^ 1). Therefore it came as a surprise that the percolation 
transition in simple models of growing random networks is infinitelv gentle, namely every 
derivative of G vanishes at the transition. Subsequent recent works 

mm 

confirmed 

that prediction for the original and more complicated models, and additionally demonstrated 
another surprising feature: the size distribution of the connected components has a power 



law decay in the non-percolating phase. This and ot 



ler features are very reminiscent of the 



12| |: the cause of this similarity remains 



Berezinskii-Kosterlitz-Thouless phase transition 
not understood. 

The growing networks which exhibit this infinite order transition are constructed as fol- 
lows: one starts with a single node and one adds new nodes one at each time step. When the 
network consists of N nodes and the 1st node is added, this new node is connected to k 
randomly chosen existing nodes among the first nodes. Thus, with probability p^, there 
are k arrows emerging from the A^ + 1st node, the targets of which are k nodes chosen at 
random among the A^ first nodes (whether we require the k target nodes to be distinct or not 
has no effect on the large A^ properties that we discuss here). The parameters which define 
the model are the probabilities pk- Natural questions one can ask about such a network are 
whether it exhibits a percolation transition with the emergence of an infinite component and 



what are the critical behaviors associated with this transition. For a Poisson distribution 
p. ^ l^e-. the mode, has been investigagd in QQ. and the case of genera, p.. was also 

recently studied by Coulomb and Bauer p who obtained a number of detailed results, in 
particular on the correlation between a cluster size and the times at which the vertices of a 
cluster were added. 



Similar models have also been studied in the mathematical literature 



HQ, 



and several 



properties of the percolation transition have been proved by Durrett [l5[ and BoUobas et 



ql 



16f . The recent interest was partly driven by the desire to mimic biological networks 
181 1: the Poisson network arose as the limiting case of the protein interaction network 

n 

and as a toy model of a regulatory network p. 



II. THE PHASE DIAGRAM AND ITS CRITICAL BEHAVIORS 

In this section, we summarize the properties of these growing networks, as derived in 
sections 3-6. The first result which emerges from the analysis of the model is that the phase 
diagram as well as all the power laws depend only on two parameters, the first two moments 
of the distribution 

p = (k) = J2k pk and A = {P) - {kf (1) 

k 

all the other parameters being irrelevant. 

Because the random variable k is an integer, the variance A is not only positive but it 
satisfies 

A>(/?-[/5])-(/?-[/?])2 (2) 

where is the integer part of (3: for a non-integer average /3, a distribution concentrated on 
integers has a strictly positive variance A. The distribution which minimizes A, at fixed (3, 
is concentrated on the two integers and [P] + 1 with pip] = 1 — P + and P[/3]+i = (3 —[P] 
leading to ©. 

Let us summarize the main properties of the phase diagram: 
1. The non-percolating phase is the region 

P-P'<A<^ for /?<i (3) 
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In this non-percolating phase, the density Cg of clusters of size s decays for large s like 
a power law with an exponent r which varies continuously with A 

2 

Cs ~ s~'^ with r = 1 H , (4) 

1 - VI - 4A ^ ^ 

2. The percolation transition line is given by 

A = i and /?<^ (5) 

along which for large s 

~ (1-2/3)2 s3 (lns)2 

3. r/ie critical boundary line is 

A = /3 - and ^ < /3 < 1 (7) 

along which for large s, 

Cs ~ s^'"^ (8) 

4. The percolating phase covers the rest of the phase diagram, namely 

(3 <\ and A > i 

I < /3 < 1 and A > /5 - (9) 
1 < (3 and A arbitrary 

where Cg decays exponentially. As one approaches the boundaries of the percolating 
phase, the fraction G of sites in the giant component vanishes in the following ways 

0</3<i and A^l: G ~ exp (^--^=^^ (10) 

(3=- and A ^ i : G ~ , exp , (11) 

2 4 V4A- 1 2V4A - 1 ; ^ ' 

^ < /5 < 1 and A /5 - : G ~ (A - /5 + /52)^ (12) 

These infinite order transitions and continuously varying exponents are reminisecent of 
the Berezinskii-Kosterlitz-Thouless transition [ill ll^. 
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FIG. 1: The phase diagram 
III. THE GENERATING FUNCTION OF CLUSTER SIZES 

When becomes large one can show jsl] that the total number of clusters Ag of clusters 
of size s becomes extensive (As ~ Ncg) with fluctuations of order of order A^^/^. Therefore 
for large A^, the random variable Ag is well characterized by its average (Ag) which evolves 
according to 



d{Ag) 



s{Ag) 



l...Sft j = l 



(13) 



A:=0 si...Sftj=l 

where the sum is taken over all Si > 1, . . . ,Sk > 1 such that Si + ■ ■ ■ + + 1 = s. On 
the right-hand side of (jl3p the negative term correponds to the decrease of the number of 
clusters of size s, when a new node is introduced which connects a cluster of size s to other 
sites, creating that way a new cluster of size large than s. The positive terms corresponds to 
all the ways the new node can make up a cluster of size s by connecting preexisting clusters. 
Writing (Ag) = Ncg we reduce Eqs. (fT^ to 

oo k 

(1 + f3s)cg = J2PkJ2]l ""^r (14) 

fc=0 si...Sfej=l 

This allows one to determine all the Cg recursively: Ci= Pq/{1 + [3),C2 = PiCi/ (1 + 2/3), C3 = 
(piC2 +P2cf)/(1 + 3/3), etc... The infinite set of Eqs. (IT^ can be converted into a single 
differential equation for the generating function 



E 

s=l 



sc. e 



(15) 



Multiplying both sides of Eq. ()14p by s e^^ and summing over s gives 

g + (3g' = {V[g] + g' Q[g]) , (16) 

where g' = dg/dz, the function V[g] is the generating function of the pkS and Q[g] its 
derivative: 

V[g]=J2p^9\ Ql9] = ^. (17) 

fe>0 ^ 



Up to a change of notation, Eq. (fT^ is the same as Eq. (17) of Ref. j6|. We will see below 
that all the properties summarized in section 2 follow from (|T6|) . 

IV. THE AVERAGE CLUSTER SIZE 

If a node is chosen at random (ouside the giant component) the average size (s) of the 
cluster it belongs to is given by 

, , S^^c. j'(o) 
= = ^ '''' 

The region without the giant component is characterized mathematically by ^'(0) = 1. If 
we replace g{z) by its expansion g{z) = 1 + g'{0)z + o{z) in Eq. ^T^\ . and use that 

V[g] = l + Pig-l) + ^^{g-lf + ... (19) 

with 

-f = A + (3^-/3 (20) 
we obtain a quadratic equation satisfied by g'{0) 

{A + P'- P)[g'm' + (2/5 - l)g'{0) + 1 = 0. (21) 

Equation ^IT\ has no positive solution when /9 > | [see ©J. For /3 < | and A < i, the 
solution to Eq. (PT|) reads 



^''^^"'"'^ 2(A + g^-/3) = l-2,3+Vl-4A - '''^ 
This suggests that different behaviors occur below, at, and above the critical line (jS)). We 
shall see in section 6 that the giant component is born when the critical line is crossed. 
When the line © is approached from below, i.e. A — > ^ — 0, one sees that 



Turn now to the phase with the giant component. Setting z = 0, Eq. ()16p gives g'{0) in 
terms of the size G = 1 — g{0) of the giant component 

3^^>- p_Qii_G] ■ ^^^^ 

There is no exphcit formula for the average size in the percolating phase due to the lack of 
explicit expression for G. However, near the critical line (0) one can use the fact that G 
to simplify Eq. (j2D) and get as A ^ ^ + 

M : 1-/^ 2 2-2^ (25) 

^ ' A + (3^ -(3 1-2/3 1-2/5 ^ ^ 

Comparing ()23p25j) shows that the average size (s) of finite components jumps disco ntinu- 
ously as one crosses the percolation transition line (jSj). 

V. COMPONENT SIZE DISTRIBUTION 

We now examine the component size distribution c^. 

A. Sub-critical regime 13 < \ and A < | 

The large s behavior of Cg can be read off from the behavior of the generating function 
(fTK|) in the z ^ Q limit. A power law decay (see Appendix B) 

Cs ~ Bs^'^ as s ^ oo (26) 

implies the following small z expansion of the generating function (|T^ : 

g{z) = 1 + g\Q)z + 5 r(2 - t){-zY-^ + .... (27) 

Substituting this expansion into Eq. ()l(i|l or rather into 

g-e'V[g]=g' {e^Q[g]-P) (28) 

we find by balancing the contributions of the order of {—zY^"^ 



_ ^ (3-1 + [A + (3^- (3)g'{Q) 
/3+(A + /32-/3)^'(0) 



which using gives (jH). 



Thus in contrast to ordinary critical phenomena, one gets a power-law ()26|) in the whole 
non-percolating phase and the exponent r given in (j^ depends continuously on the single 
parameter A. 

The amplitude B in (j26j) is, as usual for critical amplitudes, more difficult to determine. 
It can nevertheless be calculated along the line 

A = P-P^ and p <]- (30) 

where only po and pi are non-zero. Then the recursion (jl4j) reduces to 
(1 -|- /3s)cs = /3(s — l)cs__i which is easily solved 



i_/?r(i + i)r(.) i_^r(i + i: 



for large s (31) 



/? r(5 + i + i) /3 

As claimed in (jSJ, expressions ()31|) remain valid along the critical boundary line (|7j), 
where only two probabilities po and pi are non-zero. Note (see Appendix B) that for small 
z and /3 > |, one has 

,(.)^i + i^r(i + i)r(i-i)(-.)^' + ... (32) 

At /? = |, equation (jSU becomes Cs = 2/[s(s + l)(s + 2)] leading to 

g = 2e~^ — 1 — 2e^^(l — e~^) log(e~^ — 1) which gives for small negative z 

g = 1 -2zlog{-z) -2z + ... (33) 

B. Critical line < /3 < ^ and A = i 

We write as in Appendix A 

g{z) = l + zv{z) (34) 
and we obtain ()A6|) an implicit form of v{z) 

HI - (1/2 + + = MP) m 

The integration constant A{(3) cannot be determined without integrating the full equation 
(fTH|l with appropriate boundary condition: g Cie^ with Ci = as 2; —00. 
From ()34f35|l . we get the small z of g{z): 

^(") = ' + r^"+(r^M^ + ---- 



Inverting this expansion (see Appendix B) yields 

Thus, the component size distribution acquires a logarithmic correction in the critical 
regime with a remarkable degree of universality: it depends only on the average connectivity 
(3 of the network. 



C. Super-critical regime < /3 < ^ and A > | 

Above the phase transition point, both (7(0) = 1 — G and g'{Q) are finite. Repeatedly 
differentiating Eq. (jlfjj) and setting 2; = we find that all following derivatives are finite as 
well. This implies that for A > 1/4, the component size distribution decays faster than any 
power law. We now argue that 

C, ~ s-5/2g-s/s* s^OO. (37) 

An exponential factor in the component size distribution, oc e~^/^* , shifts a singularity 
of the generating function g[z) to z^, = From Eq. (fTB|) which is useful to re-write in 

the form 

we see that the singularity arises when the denominator on the right-hand side of ()H8|I 
vanishes: /3 = e^* Q\g^ where = g{Z:^). The derivative on the left-hand side of (jHH)) 
should be therefore singular suggesting an algebraic asymptotic g{z) — g^ oc {z^ — 2;)" with 
a < 1. Plugging this into ()38p gives a = 1/2. This type of singularity corresponds to 
the s~'^/^ decay of the sequence sCsC^^* (Appendix B) and hence we finally obtain (jHTj). A 
more detailed^ analysis would allow to see that ~ 1/G diverges as the percolation line is 
approached 



VI. GIANT COMPONENT 



To determine the size of the giant component G we need to understand the behavior of 
solutions to Eq. (fT^ near z = and this can be done analytically near the transition line 
or near the critical boundary line (see Appendix A). 
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A. For < /? < i and A > i 

If we write g{z) as sete = A — 1/4-Cl and we use expression ()A5|) for v{z), we get 



ln(l - {1/2- p)v) +\n{-z) 



TT 



+ 



1 



In 



(39) 



2^e {l-2f3)[l-{l/2-[3)v] 

For this solution to be consistent with ()35|1 in the hmit e — > the size G of the giant 
component should satisfy 



~ z — exp 



TT 



exp 



TT 



(40) 



1-2/5 "I 2v/^J 1-2/5 V4A - 1 

as claimed in (fTn|) . Therefore the transition is of infinite order since all derivatives of G vanish 
as A ^ 1/4. The behavior (HI]) appears quite universal as it was observed numerically 
and confirmed analytically [2,5 13,0,0,01 for other growing networks. 



B. /5 = i and A > 1 



In this region we may use either ()A4|) or ()A5|) to obtain for small 7 



(41) 



To be consistent with (jSBI), i-e. with v = —2 — 21n(— z) in the limit 7 = A — 1/4 0, 
the size of the giant component should statisfy 

^-1 



GM ~ — exp 

V7 



vr 



(42) 



as claimed in ()11|). In comparison with pn|) there is a factor 1/2 in the exponential and the 
prefactor is determined here analytically. The transition still is of infinite order. 

C. 1 < /? < 1 and A > /3 - /32 



In this range of parameters one can use ()A3|) . When one approaches the critical boundary 
line (|7j), i.e. for small 7, expression ()A3|) becomes 



^ ln[l + (2/5-l)t;]+ln(-2) =ln|7-^G| + ^''^^^ 



2/5-1 



2P-1 



(43) 
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To be consistent with 



;i-/?)r (i + ^)r(i-^ 



/3 TT 



for small z, one needs the size G of the giant component to scale as 

G(/3,7)~/^7^ ^=^^^ (45) 

p 1-/3 

where = C 2/5-1 (2/3 — 1) 2/3-1 and C is given by ()44|) . The exponent z/ is again universal 
as it depends only on the average connectivity (3 but not on any other parameter of the 
distribution p^. Yet it varies continuously with /? along the critical boundary hne ((Zj) thereby 
showing a richer behavior than in random graph models where u = 1. 



APPENDIX A: ANALYSIS OF FOR z SMALL AND 1 - g SMALL 

In this appendix we analyze the solution of (fTBjl when both z and 1 — g are small. If we 
write 

g{z) = l + zv{z) , (Al) 

equation (|T6|) gives to leading order 

{1-V + 2(3v + '^v^)z + (;5 '^v)v'z^ = 

so that v{z) satisfies 

■yv + 3 , dz , . , 

' ^ dv + — = 0. (A2) 



7t;2 + (2/? -l)v + l z 
If the size G of the giant component is non-zero, lim^^o-2^ = g{0) — 1 = —G and this fixes 

the constant of integration and leads to the following expression valid for (3 > 1/2, 7 small 

and 7 < (/3 - 1/2)2; 

^ ln[7t;2 + (2/5 - l)v + 1] + \n{-z) (A3) 

' 2^-1 + 27^+v^^^ ^ 

2v/(2/?- 1)2-47 2/5 - 1 + 27t; - vW^^F^^ 
If one tries to analytic continue ()A3|) to the region /? > 1/2, 7 small and 7 > (/3 — 1/2)^ one 
gets 

^ ln[7t;2 + (2/5 - 1)^; + 1] + \n{-z) (A4) 

2^47 - (2/5 - 1)2 2/5 - 1 + 27W - ^^47 - (2/? - 1)2 
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and to the region /3 < 1/2, 7 small and 7 > (/3 — 1/2)^ one gets (the extra term coming 
from the continuation of the logarithm) 

\ H^v' - (1 - 2(3)v + 1] + In(-z) - , ^^^^ (A5) 

2 v^47 - (1 - 2/3)2 



2^47- (1-2/5)2 1 - 2/3 - 271; + iv/47 - (1 - 2/5)2 
Lastly when G = and 7 = (/5 — 1/2)2 ^^^^ g^^g integrating ()A2 



- (1/2 - + ln(-.) + = (A6) 

where A{(3) is an integration constant. 



APPENDIX B: EXTRACTING THE ASYMPTOTICS 

Consider the generating function 

00 

A{z) = J2(^se''. (Bl) 

s=l 

The dominant singularity of the generating function allows one to extract the large s asymp- 
totic of ag. This can be done by a variety of techniques Q|- Here we give an elementary 
exposition that is sufficient to extract the asymptotics used in this paper. 

Let us first see what kind of the singular behavior is associated with the power-law 
asymptotic 

as^As" as s ^ 00. (B2) 

If a > — 1, then A{z) diverges as 2 t 0. The dominant contribution is found by replacing 
summation by integration: 



A{z)^ dsAs'^e'' = AT{l + a){-z)-^-'' . (B3) 
Jo 

If — 2 < a < —1, the sum Ylis>i converges and instead of ()B3|) we get 

A{z) = A{0) + Ar{l + a) (-z)-^-" + . . . . (B4) 
Similarly for — 3 < a < —2, 

A{z) = A{0) + A'{0)z + Ar{l + a) {-z)-^-" + ... (B5) 
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where of course A{0) = ^^y^ and A'{0) = J2s>i ^^s- Thus if the dominant singular term 
has the power-law form {—z)^^~°' y4r(l + a), the asymptotic must have the form ()B2|1 . 

Above we assumed that a is not a negative integer. Otherwise logarithms can arise. For 
instance, if a = —1 we get 



oo 

— 1 „sz 



A{z)^J dsAs-'e'' ^ ~A\n{~z). (B6) 

Imagine now that the dominant singular term is a power of ln{—z). It is tempting to test 
the asymptotic 

tts ^ A {In s)'" as s ^ oo. (B7) 

For a < 1, it leads to 

, r'/(-^) ds ^ \{l-a)-^[-\n{~z)Y-^ a<l- 

A{z) ^ A / — — - Ayi { B8 

J ^^1^^)" [ln[-ln(-.)] a = l. 

For a > 1, the sum X]s>i '^s converges and we get 

A{z) = A{0) - ^ [- ln(-;.)]^-'^ + • • • • (B9) 

As an example p6|l . imagine that we have found the small z expansion of the generating 
function g{z) = XI -^c^ e''^ that reads 

g{z) = Ao + Aiz + A-^- + .... (BIO) 

ln(— 2;) 

DifRrentiating gives 

g'(z)=A^ + A—^ + ... (Bll) 
ln(— 2;) 

which, in conjuction with ()B9|) . shows that a = 2. Using the asymptotic ()B7|) and g'{z) = 
J2 Cs e'^^ we get as in © 

Cs ^ As'^ilns)"^ as s 00. (B12) 
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